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1. Introduction 

We begin our discussion of a general Bayesian approach to making 
inferences about change-points by considering a simple special case. 

Suppose that yi,...,y„ are realizations of a random process such that, in 
the absence of any change in underlying structure, the joint density of 
>'i» • • would have the form 

p(yu''.yn\^i)='^^p{yt\*il (i) 

where is a vector of parameters. In such a case, we say that the process 
exhibits a change-point at time r (lgr<«) if there exists i^ii^i^i) such 
that 

=npO'*ki) n p(yik2). (2) 

[The notation p(* | *) is simply a generic symbol for a conditional density; its 
actual interpretation will always be clear from the context.] 

We shall denote by Af, the model which assumes there to be a change 
point at r. In the case of known ^i, we therefore have 

P(yi,..M>'„|MJ=p(yi....,y„|r,^^i,^2), (3) 

♦The material in section 4 is based on Chapter 2 of the first author's PhJ5. thesis written at 
University College. London, under the supervision of the second author. 
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given by (2), whereas, if ^2 unknown, we have 

p(y,....y.|A^J=^Jp(y,....y.|^^,.^Jp(*l.^2)#l<i*2. 



(4) 



where p{^x,if^ specifies a prior density for ^^j, ^2 (^^re assumed independ- 
ent of M^y It is convenient to denote by Mq the model which assumes that 
no change in underlying distribution has occurred In the case of unknown 
^^j, we therefore have 



(5) 



whereas, in the case of known ^^1. the left-hand side of (5) is given directly by 
eq. (1). 

Given yu-.^yn inferences about the change-point are equivalent to in- 
ferences about the alternative models Mq, Afi,..., Afo_,. The basis for such 
inferences is provided straightforwardly by Bayes' theorem, which takes the 
form 



p( Af , I y 1 . . . y J oc p(y 1 . . . „ y„ I M ,) 



(6) 



where pCM,) denotes the prior probability of a change-point occurring at r. 

Thus, for example, comparison of the posterior probabilities of two 
alternative hypotheses M,, M„ say, is given by the ratio of the two terms (for 
r and s, respectively) provided by (6). A convenient way of re-expressing this 
is in the form 



yu'-^y^lfi^r) p(yi 



yi»..-»yj/ p(yi 



Mr) 



My 



(7) 



displaying the ratio of posterior to prior odds on against M, as a 
(usually, integrated) likelihood ratio. The right-hand side of (7) is called the 
Bayes factor for against and will be denoted by B„. A value of B„> 1 
thus indicates evidence from the data pointing towards r as a more likely 
change-point than s. In particular, if s = 0, B^q provides an indicator of 
whether a change-point at r is better supported than an assumption of no 
change. Since Bor-B^^ and J8„=Bh)Bo„ it will suffice, in what follows, to 
work in terms of B^. 

If we are interested in an overall assessment of change versus no change, 
we note that the appropriate ratio of posterior to prior odds is given by 

f\-P{Mo\y„..^y^\ l fl-nMo)\ ^„ { P(M,) \ ... 
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The dght-hand side of (8) has the form of a weighted average of individual 
Bayes factors for changes at specific times, weighted by the a priori 
probabilities of those changes, conditional on a change having occurred. In 
particular, if the F(M,)/(l-P(Mo)) terms are equal over the range of r for 
which they are non-zero, then (8) is just an average of the Bayes factors B^, 
taken over all the possible change-points. 

We began our discussion by considering a simple structure, summarized in 
(1H5). This will be illustrated in section 2, where we consider changes of 
means in uni- and multivariate normal sequences. For more general pro- 
blems of changes in structure of regression or time series models (sections 3 
and 4), (1H5) no longer apply, but, provided we can specify the form 
ptXi,..^^^! Af^) occurring in (6), the forms (6H8) still provide the basis for 
inferences about r. In addition to inferences about r, it is straightforward 
within the Bayesian approach to provide inference statements about other 
unknown parameters occurring in the specification of model structures. 

Detailed Bayesian analysis of parameter shifts in univariate series have 
been given in Broemeling (1972, 1974), Smith (1975), Lee and Heghinian 
(1977) and Menzefricke (1981). Some aspects of this work will be summarized 
briefly in section 2, but in this paper we shall concentrate mainly on the 
regression and time series cases (sections 3 and 4). We have reported 
previously on these problems in Smith (1977), Smith and Cook (1980) and 
Smith (1980), but we shall be concerned here with developing our previous 
ideas in two new important directions. First, we provide a possible solution 
to the problem of comparing the no-change model with a model which 
assumes change, under the assumption of vague prior information for the 
parameters within each model. Since the models being compared have 
different dimensions, we are forced to confront directly the problem of 
assigning values to the (usually ignored) arbitrary constants arising in the 
specification of forms of improper prior density. Our basic suggestion for 
overcoming the problem will be introduced in section 2. Secondly, we 
provide a unified account of a procedure which greatly facilitates the 
treatment of the time series case by relating it to a regression problem. The 
technique used is closely related to the problem of finding the exact 
likelihood for ARMA processes. 

Throughout, we shall be concerned with retrospective inference about 
change. For Bayesian approaches to on-line monitoring, see, for example, 
Harrison and Stevens (1976), Smith and Makov (1980) and Smith and West 
(1982). 

1 Changes of mean in nonnal sequences 

2,L The tmivariate case 

Bayesian inference about a change of mean in a univariate normal 
sequence is dealt with in detail by Smith (1975. sec. 4), and the change of 
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variance case is treated by Menze&idce (1981). In this section, we shall just 
consider the problem of comparing models of change of mean versus no 
change when there is vague prior Icnowledge about the model parameters. 
We begin with the simple case of a conditionally independent univariate 
normal sequence and then generalize to the multivariate case. This will 
enable us to highlight the problems which arise in calculating Bro> and hence 
(8), when standard nnproper prior forms of representation of vague know- 
ledge are assumed. 

Corresponding to (2), with ^j={^j,a% we shall assume that, for Af^ with 

Further, we shall assume a prior density pC^i^^J of the form 

p0^u^2.<^)=P(Mi\ <r) pO* 2 1 pip) 

= {c,(2na^)-*}{c,(2n<72)-*}ca-^ 



(9) 



where c^.Ci^c are unspediied constants [see, for example, Jeffreys (1961)]. 
It then follows from (4), after some straightforward algebra, that 

p(yi,....yjM,) 

I(yi->J'+ I (yr-y(.-r))M . 

i-1 i=7+i J 



(10) 



where >'r,}'oi-,) denote the mean of yi....,y, and y,+i,...,yfl, respectively. In 
the case of Mq, we take p(4fi)=p{Mi \ o)^<j) and obtain 



Viyi y,|Afo)occiC 



r B ^^-n^2 



(11) 



We now note that if we take the ratio of (10) and (11) in order to obtain 
from (7), we obtain, afler some algebra. 



(12) 
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where <f is the usual r-statistic for testing the hypothesis /i^ =/i2, given the 
sampling distribution defined by (9). The form of (12) thus shows a functional 
connection between B^, and hence (8), and classical test statistics for 
comparing means. In particular, is monotonic increasing with t^, and a 
minimum (i.e., showing least evidence of a change at r) when t^=0 
(corresponding to y,=y(^-,)). 

However, the unpleasant - and unavoidable - feature of (12) is that it 
depends on the unspecified constant €2- Note that this problem does not 
arise if we calculate say, since, when we form the ratio of two terms 
defined by (10), the factor C1C2C cancels. Essentially, when we compare 
and Afo, assuming vague prior information, we are dealing with models of 
different dimensionahty (two unknown parameters under Mq versus three 
under M,) and arbitrariness necessarily enters into a Bayes factor derived, 
implicitly, from improper limits of proper priors [see, for example, Lempers 
(1971) and Atkinson (1978)]. 

In order to motivate a choice for the value of in (12), we use the idea of 
a 'thought experiment', introduced and developed in detail in Smith and 
Spiegelhalter (1981) and Spiegelhalter and Smith (1982). 

We imagine that we have observed a sample of minimal size to enable us 
to compare Mq and M„ and that this sample has provided maximal evidence 
in favour of M,. This corresponds to assuming that n=r+l and t^=0. In 
such a case, we should wish to have B^<1 (evidence favours Mq), but (since 
we have only one observation following r) we should not wish B^ to differ 
much from 1. If we enter the imagined values of n and tf, and take B^'-^l to 
represent our unwillingness to regard such a sample as providing other than 
very minimal evidence, we can deduce immediately from (12) the unspecified 
choice of Cj. 

In fact, we have assumed in (9) that the prior specification for p{iii\a) 
should not depend on M,. We also require the smallest possible 'imaginary 
sample* in order for ft: 1 to be a reasonable reflection of minimal evidence 
corresponding to t?=0. Combining these two elements, we see that an 
appropriate choice for a minimal 'calibrating* sample would be to take r=2, 
/i = r+l = 3, which just provides sufficient information about the three 
unknown parameters ptufi2,<^^ Entering these values, together with f?=0, 
^20 = 1, into (12) we obtain c2={2/3)* and hence an explicit form for (12). 
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variance case is treated by Menzefridce (1981). In this section, we shall just 
consider the problem of comparing models of change of mean versos no 
change when there is vague prior knowledge about the model parameters. 
We begin with the simple case of a conditionally independent univariate 
normal s^uence and then generaUze to the multivariate case. This will 
enable us to highlight the problems which arise in calculating B^^ and hence 
(8), when standard improper prior forms of repr^ntation of vague know- 
ledge are assimied. 

Corresponding to (2), with i^j=(npa), we shall assume that, for with 
p(>'J,...,y.|^^ul^2) 

Further, we shall assume a prior density p(^i,^2) of the form 

PiMift^if <^)=p0^i \(y)p(M2\ <r) P(ff) 

- {c,{2naY^}{c2{2n<7Y^}cc- \ (9) 

where Ci,C2,c arc unspecified constants [see, for example, Jeffreys (1961)]. 
It then follows from (4), after some straightforward algebra, that 



OCC 



(10) 



where ynPin-r) denote the mean of yi,...,y, and respectively. In 

the case of A/©, we take p(^i)=p(jii\ar)p{G) and obtain 



p(yi,...,y,|Mo)occicn-* 



«/2 
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We now note that if we take the ratio of (10) and (11) in order to obtain 
B^o from (7), we obtain, after some algebra. 
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where tf is the usual r-statistic for testing the hypothesis Mi=P2> given the 
sampling distribution defined by (9). The form of (12) thus shows a functional 
connection between B^, and hence (8). and classical test statistics for 
comparing means. In particular, is monotonic increasing with tf, and a 
minimum (i.e., showing least evidence of a change at r) when tf—0 
(corresponding to y,=y(»-„). 

However, the unpleasant - and unavoidable - feature of (12) is that it 
depends on the unspecified constant Cj. Note that this problem does not 
arise if we calculate B„, say. since, when we form the ratio of two terms 
defined by (10), the factor CjCjC cancels. Essentially, when we compare Af, 
and Mo, assuming vague prior information, we are dealing with models of 
different dimensionality (two unknown parameters under Mq versus three 
under M^) and arbitrariness necessarily enters into a Bayes factor derived, 
implicitly, from improper limits of proper priors [see, for example, Lempers 
(1971) and Atkinson (1978)], 

In order to motivate a choice for the value of C2 in (12), we use the idea of 
a 'thought experiment*, introduced and developed in detail in Smith and 
Spiegelhalter (1981) and Spiegelhalter and Smith (1982). 

We imagine that we have observed a sample of minimal size to enable us 
to compare Mq and M,, and that this sample has provided maximal evidence 
in favour of M^. This corresponds to assuming that n = r+l and tf=0. In 
such a case, we should wish to have B^o < 1 (evidence favours Afo), but (since 
we have only one observation following r) we should not wish Bro to differ 
much from 1. If we enter the imagined values of n and t^, and take B^o 1 to 
represent our unwillingness to regard such a sample as providing other than 
very minimal evidence, we can deduce immediately fi-om (12) the unspecified 
choice of Cj. 

In fact, we have assumed in (9) that the prior specification for p(jti\<7) 
should not depend on M,. We also require the smallest possible 'imaginary 
sample* in order for B^o^i to be a reasonable reflection of minimal evidence 
corresponding to tj=0. Combining these two elements, we see that an 
appropriate choice for a minimal 'calibrating* sample would be to take r=2, 
n=r+l=3, which just provides sufficient information about the three 
unknown parameters ^i,^2*<r. Entering these values, together with t?=0. 
B20 = 1, into (12) we obtain Cj =(2/3)* and hence an explicit form for (12). 
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where the observation is now a p-vector, and that under we assume 

yi^NJipi,!), i = l...,«, (14) 

In both cases, the sequences of observations are assumed conditionally 
independent For the unknown parameters /Ki,p2>^> invariance arguments 
[Jefi&eys (1961)] lead to improper prior specifications of the form 



where 



p(Ai;|D=c/27i)-W2|Z|- 

p(D=c|2;|-<'+»>/2 



(15) 



(16) 



Using standard integration formulae [sec, for example, Smith and Spiegel- 
haltcr (1981. sec. 17.2.2)], we can easily find the forms of 

pOi. . . j^J M,) = JJ KVi . . . J'o I Ai,, A<2. ^ 

X P(Pi I ^p(/'2 1 ^P(^dfii d/ijdX, 

and 

p(yi.--.yo\Mo)--l[p{yu..^.y„\fiui)p{M,\i)p(i)d^ 

Taking the ratio of (17) and (18), we find that 



where 



(17) 

(18) 
(19) 



(20) 



Arguing as in the previous section, we see that a minimal calibrating 
sample is provided by r=p, »j=2p+l. If this gave so that 

S,+S^„.,f=S„ we should want, approximately, B^=l. Substitution into (19) 
leads to the choice 



Jp(P+ l) Y' 
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Using this value, (19), in conjunction with (8X provides a basis for retro- 
spective inferences about changes of mean in a sequence of multivariate 
normal observations. In a similar way. Bayes factors for changes in cova- 
riance structure can be developed using modifications of the analysis given in 
Smith and Spiegelhalter (1981. sec. 17.3). 



3. Changes of coefTicients in regression roodek 

3.1. Introduction 
We shall consider the regression model 

>;=Jr/'/^'^ + e,, ^=1 n, (22) 

where, at time i, 3^ is the observation on the dependent variable, X( is the 
column vector of observations on p regressor variables (including, possibly, a 
constant term), /f*'^ is the column vector of unknown regression coefficients, 
and £| is a disturbance term, assumed independently normally distributed for 
differing f, with zero mean and constant variance c^. 

The sequence defined by (22) will be said to have a change-point at r 
(lgr<n)if 



with unknown /(, Si=0. We denote this model by M,. The model of no 
change, J=0, will be denoted by Mq. 

If we write K— (yi,..., and denote a realization of Y by then 
adopting the notation 



we see that, under Af,, 



where 



(23) 



(24) 
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whereas, under M o, 

Y^NJiAofi^a'a (25) 
where Aq=X^. 

In (23H25X we have defined p(y\M^e,a) and ply\MoJ,a). To utilize (6)- 
(8) we require 



and 



P(v|M,)=JJp(v|M„^.a)p(a,a)deda. 
P(V I Wo) = Jf p(y I Mo , /r, a)p(fi, <t) d^ do; 



(26) 



(27) 



we thus need to specify p(e,a) and pifi.e). This specification, and its relation 
to the problem of comparing alternative linear models, has been discussed 
extensively in the literature. Recent developments, together with extensive 
references, are presented in Smith and Spiegelhalter (1980) and Spiegelhalter 
and Smith (1982). In the next two sections, we shall consider two particular 
forms of prior specification. 

32. A proper prior specification for 6 

We consider first, following Smith (1980), the consequences of the 
specification 



m<r)^F{0\o)p{ol 



(28) 



where p(<r) = c(7~S and p{d\a) corresponds, for \^r<n, to a normal distri- 
bution with mean and covariance matrix o^^o, where 



In the case of Afo. we simply take Vq= Vq^, ^o = Po- With this specification, it 
is easily verified [Smith (1980, sec. 3)] that if ' may be considered small in 
relation to AjA^, then 



where denotes the usual F-statistic for testing Mq versus M,, 



(29) 
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General discussion of the comparison of linear modeb with informative 
conjugate prior distributions is given in Lempers (1971). Applications to 
change-point problems arc given in Holbcrt and Broemeling (1977). Smith 
and Cook (1980) and Chin Choy and Broemeling (1980a, b). 



J.J. Vague prior irtformation for all parameters 
Returning to (28), we now assume that, under Af ^, 

p{0.a)=p{P\o)p(d\<7)p{a) 

whereas, under Mo, 

p(P,a)=Cf,{2naY'^h<^''^ 



(30) 



(31) 



From (23H25), (30) and (31), it easily foUows [see Spiegelhalter and Smith 
(1982) for a related calculation] that 



Motivating a choice of by means of a calibrating thought experiment' is 
less straightforward than in the cases considered in section 2. A detailed 
argument is provided in Spiegelhalter and Smith (1982, sec. 4) and leads to 
the suggestion 



4. Changes of mean in linear time sefk» models 
4,L General ideas 

We assume that observations z=(Z|,...,2y are available from a doubly 
infinite process which may be modelled, for t = . . - 1, 0, 1, . . ., by the ARM A 
(u,t?) form 

(1-^iB <^^x,-|0=(l-M-"— OrB^t (34) 

as in Box and Jenkins (1970). We then denote by M, the model which 
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assumes that 

and by Mq the model which assumes that 



(35) 



(36) 



Using ideas from Box and Tiao (1965) and Glass et aL (1975), we seek a 
linear transformation from z=(zi,.., 2^ to y==(yi,,,., yj^ such that, under 
Af ^ the random vector Y, corresponding to y, can be described by 



(37) 



where A^=Aj^^,a) is an nx2 matrix, whose entries are functions of r and 
(^^ a')=(0,...^ 4>^, ai,..^ «„), and e=(^, 5)^ Under Mq, we require 



(38) 



where /4o='4o(^.«) is n x 1 and given by the first column of A, (correspond- 
ing to ^=0). 

Before turning to a detailed discussion of the form of such a trans- 
formation, we note the following important relationship with the work of 
section 3. If p(ff,ff), p(P,a) are defined either as in (18), or by (30). (31), with 
p= 1, we can calculate 



and 



p(z I ^, flc) = Jf p(2 1 M „ a, ^, a) p{0, a) dOda. 



P(z|Afo,<A>a)-JJp(z |Mo,ft(7,^.(x)p(/r,a)d^da, 



(39) 



(40) 



where the first terms in the integrals are defined by (37) and (38) multiplied 
by the Jacobian of the transformation, y(^.a), say. If we then integrate (39) 
and (40) with respect to a prior specification /?(^,a) for the ARM A 
parameters, thus obtaining p{z\MX p(z|Afo), the ratio of these latter terms 
gives The resulting form is 



p - JP(»>«)M?'(^>«)-^r(»>«)|'^[W>«)]-"^^p(^,«)d^ 

' JI^(*.«)|/if(^,a)^o(*.«)|"*[5o(«,«)]-'"W.«)d0da ' 



(41) 
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where K^^V^} [if p(tf.<7) is defined by (28)]. or Ka=Ca=y3 [if p(6,a) is 
defined by (30)], and S^^.a), SJi<^,a) denote the usual residual sum of 
squares resulting from least squares fitting of the linear models defined by 
(37) and (38). 

In most applications, u and t; are small and an adequate approximation to 
the integrals arising in (41) is obtained by summation over a suitable grid 
covering the admissible region (with respect to stationarity and invertibility) 
of if>y a. 

The general approach to deriving an appropriate transformation is as 
follows. 

If s = max(u,i?), the first s z/s must be transformed directly to a set of 5 
independent y/s by defining 



^2=^2(^2+^12221) 



(42) 



where the A,'s are chosen to reduce the variance of the y,'s to cr^, and the Jiy's 
are chosen successively in such a way that the y,*s are independent. For 
s<i^n, we define 



yi=^iZi-4>\2i-i-"'-<t>u^i-u-^nyi-\ — "'-'^ivyi-v\ 



(43) 



where the successive choices of the A/s and the t^/s ensures homoscedastidty 
and indep>endence, respectively. 

For AR(u) models, all the terms are 0 and Ai=l for s<i^n. For 
ARMA(u, v) models with u>0, y>0, both the Xy and ki terms are eventually 
dz^ined recursively, at least from i=p+^+l. MA(i;) models lose the it>fi-j 
terms. In the following sections, we provide a detailed development of all 
cases with u + pgZ Ifu + vis large, it may be difficult to identify directly the 
first u + v y/s. An alternative approach is to find the lower triangular matrix 
J"* of the transformation firom (zi,..., z„+J to (yi,.... y.+J by using the 
relation 7I^ = a~^r, where F is the variance-co variance matrix of Zi,..., 
z«+„. It will be seen that the techniques used here are closely related to the 
problem of finding the exact likelihood for ARMA processes [see, for 
example, Newbold (1974) and Galbraith and Galbraith (1974X but whereas 
these authors invert the variance-covariance matrix of a sequence of 
observations, the method proposed here effectively diagonalises it]. 
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4± The AR(I) case 
The required transformation is defined by 

y,=z,-^i2i.i, 1 = 2,. ..,nj 
with Jacobian (1 -0?)* The A, matrix has the form 



where 



(44) 



(45) 



(46) 



(47) 



fll=(l-<^i)*, ^2 = 03— • = a„ = (l-<^,) 
^=^'2 = --=&r = 0. i>,+ i = l. 6,^2=" = 6„ = (1 

¥J. r/ie ^/J(2) case 
The required transformation is defined by 

yi=(yoV)'*zi 

where 

(VoM = (l-W[(l+02){(l-<^2)'"<^f}]-^ Pi=^<t>xa'(t>2)-\ 

with Jacobian (yo/<y^)"Hl -p?)"*- The matrix, (45), is defined by 

fli =(yo^')-* 02 = C(yo/(^'Xi -pf)] -Pi) 



(48) 
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where 
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•■=fc,=(l-0.)J 



(46) 



(47) 



>), is defined by 
tl-Pi) 

(48) 
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4.4. The MA{1) case 
The required transformation is defined by 



where 



i=2,...,rt 



A;=(l+af-T?)-* = (l-aJ')*(i-ai*"''*)"*. 
The Jacobian of the transformation is given by 

and the elements of the matrix, (45), are defined by 

fli^^i, af=Ai(l-T(ai_i), 1 = 2.. ..,n ^ 

6i= •• = 6, = 0, ^ = A((1-Ti6(_i), i = r+l,...,nj 

We can solve (52) to obtain explicit forms, 

%=(l-ai)'\l-«i)*(l-a'/W+ai)-*(l+a'^M-^ 



(49) 



(50) 



(51) 



(52) 



j = l,...,fi. 



fe,^,=(l-a,)-^(I-«',Xl-af'^'*^Xl-a?*'^'r*(l-af^^'^^r*, 

i=l,...,n-r, 

and some simplifications are possible for large i and n. For example, as 
I-+0O, Af-^l, flj->(l— ai)~', 6^+,-^(l-ai)"', Tj-^— a,, and, as n-+oo, 
J(^,«)-(l-a?)* 

^J. The MA{2) case 
The required transformation is defined by 



y( = Af(2, + T(y, _ 1 + ittyt .3), i = 3. . . n J 



(53) 
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where 



and 
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1- 



(54) 



(55) 



The Jacobian of the transformation is given by ... and the elements 
of Af, are defined by 



fl( = 1 + tj^Ji - 1 + 7t,-ai _ 2), I = 3, . . 

*r + i = ^ + .'(l+Tr + (^'r + i-l+'tr + A + (-2X i=l....,n-r 



(56) 



For large values of 1. A^-^^l. Ti-a^, Ti^-^a^, a^-^CI -aj -aj)"', fr,- 
(l-tti-aj) 



4,6. The ARM A{I J) case 
The required transformation is defined by 



where 



Ai=(yo/<T')-*=(a?-2t^,a, + l)-*(l-<^f)* 
T(=-a,Vi» >lf-(l+af-T?)-\ 1=2,. 
ExpUcitly, we have 

Ai=(C-Daf'-^>)*(C-Daf)-* 



(57) 



(58) 



(59) 
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where 



and the Jacobian is given 
J(^.«) = A, ^2. 
The elements of the 

leading to the explicit for 



( 



As i-4oo, Tf-^— ai, fl,-^ 
n-^oo, 

y(^,«)^(i-0 
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and the Jacobian is given by 
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The elements of the matrix are given by 

a^=:Ai, ai=X^\-<t>i-xfii.i\ 1 = 2,.. 

leading to the explicit forms 



^ (1 -»^X1 -ar 'XC-Da^,) + ar ^(1 -ai)(l -al)(l-<t>\) 
' (1 -a,XC-/>af -^>)^<C-Oa?0* 



21 



(60) 



(61) 



(62) 



i=l,..^n 

( 1 - a 1 XC - Da? ('-^ ' - ' 0*( C - Oa f <' 

i=l,...,/i-r J 



(63) 



As iAoo. T,-*-ai, a,->(l -0iX^ -a^) ^ 6,+,-^(l -</>iXl -a^)"^ and. as 
J(^,«)-(l-<^5)ni-af)*(l-<^ia,)-^ 



References 

Atkinson, A.C^ 1978, Posterior probabilities for choosing a regression model, Biometnka 65, 39- 
48. 

Box, G.E.P. and G.M. Jenkinsi, 1970, Time series analysis; Forecasting and control (Holden-Day, 
San Francisco, CA). 

Box, .G.EP. and G.C Tiao, 1965. A diange in the levd of a non-stationary time-series, 
Biometrika 52, 181-191 

Broemeling, LX>^ 1972, Bayesian procedures for detecting a change in a sequence of random 

variables, Mctron XXX, 1-14. 
Broonding, LJ>., 1974, Bayesian inferences about a changing sequence of random variables. 

Communications in Statistics 3, 243-255. 
Chin C3ioy, J.R and L.D. Broemeling, 19$0a, Some Bayesian inferences for a changing linear 

model, Tecfanometrics 22, 71-78. 



i. 



(Page i8 of 18) 




22 



NM. Booth and ASM. SmiiK Retrospectwe identifieatUm qf change-points 



Journal of EooxKi'inetrics 



Chin Choy. JiL and LD. Broemding, 1980b, Detecting the dhange in a linear model Technical 

rqxm no. 14 (Department of Statistics, Oklahoma State University, Stillwater. OK). 
Fenttra. PX., 1975, A Bayesian analyas of a switching regression model: A known number of 

regimes. Journal of the American Statistical Association 70, 370-374. 
Galbraith, R.F. and JX Galbraith, 1974, On the inverses of some patterned matrices ariang in 

the theory of linear tim&^cries, Journal of AppL Prob. 11, 63-71. 
Glass, J.M. Gottman and VX. WiUson, 1975, Design and analyas, of time-series 

experiments (Colorado Associated University Press, fiioulder, CO). 
Harrison, PJ. and CF. Stevens, 1976, Bayesian forecasting (with discussion), Journal of the 

Royal Statistical Society B38, 205-247. 
Holbeit, D. and LJ>. Broemehng, 1977, Bayesian inference related to shifting sequences and 

two-phase regression. Communications m Statistics 6, 
Jeffreys, H., 1936/1961, Theory of prohaWHty (Oxford University Press, Oxford). 
Lee. A.F.S. and S.M. Heghinian, 1977. A shift of the mean level in a sequence of independent 

nomial random variables - A Bayesian approach. Technometrics 19, 503-506. 
Lempers, F.B., 1971, Posterior probabDities of alternative linear modeb (Rotterdam University 

Press, Rotterdam). 

Menzefirtckc, U.. 1981, A Bayesian analysis of a change in the predsioD of a sequence of 
independent normal random variables at an unknown time point. Applied Statistics 30; 141- 
146. 

Newbold, P., 1974. The exact likelihood function for a mixed autoregressive-moving-average 

process. Biomctrika 61, 423-426. 
Smith, A.F.M., 1975, A Bayesian approach to inference about a change-point in a sequence of 

random variables, Biometrika 62, 407-416. 
Smith, A.F.M., 1977, A Bayesian analysis of some time-varying models, in: J.R. Barra et aU 

Recent developmoits in statistics (North-Holland. Amsterdam) 257-267. 
Smith, A.F.M, 1980, Oian^point problems: Approaches and applications, in: J.M. Bernardo et 

aL, eds., Bayesian statistics (Valencia. University Press, Valencia) 83-98. 
Smith, A.F.M. and D.G. Cook. 1980, Switching straight lines: A Bayesian analysis of some renal 

transplant data. AppUed Statistics 29. 180-184. 
Smith. A.F.M. and U.E Makov, 1980, Bayesian detection and estimation of jumps in linear 

systems, in: O.LR. Jacobs et al.. eds.. The analysis and optimization of stochastic systems 

(Academic Press. London) 333-346. 
Smith. A.F.M. and DJ. Spiegelhaltcr, 1980, Baycs factors and choice criteria for linear models, 

Journal of the Royal Statistical Society B42, 213-220. 
Smith. A.F.M. and D.J. Spiegelhaltcr, 1981, Bayesian approaches to multivariate structure, in: V. 

Bamett. ed. Interpreting multivariate daU (Wiley, Chichester) 335-348. 
Smith, A.F.M. and M. West, 1982, Monitoring renal transplants: An application of the multi- 
process Kahnan filter. Submitted for publication. 
Spiegelhaltcr, DJ. and A.F.M. Smith. 1982, Bayes factors for Imear and log-linear modeb with 

vague prior information. Journal of the Royal Statistical Society B44, forthcoming. 



f 

f 

f 

r 

r 



BAYESLU" 
PARAME 



1. IntrodQctiOD 

The gamma distri 

/(x;a)=- 

where kisa. known 
widely used as a mo 
practical examples ii 
Under certain cot 
occurs at least once 
is (1). In other won 
Xji+i,...,x, of indep< 

JCi-/(Jc;( 

where / corresponds 
X is the parameter tl 
take values between 
therefore ^^^^j. 

Under the previo 
first is the detection 
change point and t 
change has occurred 

Several authors 
problems for the n 
Bacon and Watts ( 
Holbert and Broemi 



0165-7410/82/0000- 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the appUcant. 

Defects in the images include but are not limited to the items checked: 

^BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 
^P^FERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



